Digital Signal Processing on MMXTM Technology

نویسنده

  • Yen-Kuang Chen
چکیده

Algorithmic-level optimization and programming-level optimization are tightly coupled with each other. Many programmers can optimize the implementation of a specific algorithm using MMX™ technology. However, without algo-rithmic-level optimization, the speed-up of the optimization will be limited. On the other hand, many algorithm developers can optimize the DSP algorithm in terms of the numbers of operations (multiplications or additions). Nonetheless, without implementation details, the number of operations cannot be directly translated into the number of clock cycles spent in CPU. Moreover, many algorithms can accomplish the same task. For best performance of DSP/multimedia applications on personal computers, we should consider algorithm-MMX co-optimization. One way to increase performance of digital signal processing is to execute several computations in parallel. MMX is one of the techniques that speed up software performance by performing the same operation on multiple data elements in parallel using a single instruction. However, MMX programming and designing DSP algorithms for MMX are full of twists and turns. Implementation of digital signal processing using MMX technology is a mix of a science and an art. Matching the algorithms to MMX instruction capabilities is the key to extracting the best performance. This chapter is covering algorithm design and algorithmic-level optimization for MMX. In this chapter, besides showing you how to optimize your code and algorithm-from a science view, we will show you how we go about optimizing ours-from an art perspective.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Implementation of Digital Demodulator for Frequency Modulated CW Radar (RESEARCH NOTE)

Radar Signal Processing has been an interesting area of research for realization of programmable digital signal processor using VLSI design techniques. Digital Signal Processing (DSP) algorithms have been an integral design methodology for implementation of high speed application specific real-time systems especially for high resolution radar. CORDIC algorithm, in recent times, is turned out to...

متن کامل

Automatic Detection and Localization of Surface Cracks in Continuously Cast Hot Steel Slabs Using Digital Image Analysis Techniques

Quality inspection is an indispensable part of modern industrial manufacturing. Steel as a major industry requires constant surveillance and supervision through its various stages of production. Continuous casting is a critical step in the steel manufacturing process in which molten steel is solidified into a semi-finished product called slab. Once the slab is released from the casting unit, th...

متن کامل

Transition from analogue-to-digital nuclear electronics for precise gamma radiation spectroscopy: advantages and limitations

In this paper, digital signal processing for precise gamma ray spectroscopy is presented. The basis of this system is a 14bit waveform digitizer which samples the output of pre-amplifier signals directly. The advantages and limitations of the digital spectrometer compared with analog spectrometer has been tested and analyzed from precise gamma ray spectroscopy point of view. The results shows t...

متن کامل

Implementation of a high-quality Dolby Digital decoder using MMX TM technology

Dolby* Digital is a high-quality audio compression format widely used in feature films and, more recently, on DVD. PCs now offer DVD drives, and providing a Dolby Digital decoder in software allows decoding of Dolby Digital to become a baseline capability on the PC. Intel’s MMXTM technology provides instructions that can significantly speed up the execution of the Dolby Digital decoder, freeing...

متن کامل

Digital Binary Phase-shift Keyed Signal Detector

We have developed the effective algorithm for detecting digital binary phase-shift keyed signals. This algorithm requires a small number of arithmetic operations over the signal period. It can be relatively easy implemented based on the modern programmable logic devices. It also provides high interference immunity by identifying signal presence when signal-to-noise ratio is much less that its w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000